Attribute-based histogram equalization (HEQ) and its adaptation for robust speech recognition

نویسندگان

  • Xiong Xiao
  • Chng Eng Siong
  • Haizhou Li
چکیده

Histogram equalization (HEQ) is a simple and effective feature normalization technique for robust speech recognition. Recently, we proposed to adapt HEQ transform to each test utterance using a maximum likelihood (ML) criterion and observed improved performance. In this paper, we further the study by applying attribute-based HEQ and its ML adaptation. Instead of applying a global HEQ transform to the test utterance, we propose to apply different HEQ transforms to the 6 manners of speech, e.g. vowel and fricative. We also developed the ML adaptation algorithm of the attribute-based HEQ. Experimental results show that the attribute-based HEQ adaptation obtained 21.8% and 19.5% relative error rate reduction over the global HEQ baseline on the Aurora-2 and Aurora-4 benchmarking tasks, respectively.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Comparative Study of Histogram Equalization (HEQ) for Robust Speech Recognition

The performance of current automatic speech recognition (ASR) systems often deteriorates radically when the input speech is corrupted by various kinds of noise sources. Quite a few techniques have been proposed to improve ASR robustness over the past several years. Histogram equalization (HEQ) is one of the most efficient techniques that have been used to reduce the mismatch between training an...

متن کامل

Improved Histogram Equalization (heq) for Robust Speech Recognition

With the rapid development of Intelligent Transportation Systems (ITS), how to provide users with a natural and efficient humanmachine interface is now becoming a crucial issue for driver safety. It is no doubt that speech will be one of the best mediators of human-machine interaction; however, the performance of automatic speech recognition (ASR) always radically degrades when the input speech...

متن کامل

Sub-Band Level Histogram Equalization for Robust Speech Recognition

This paper describes a novel modification of Histogram Equalization (HEQ) approach to robust speech recognition. We propose separate equalization of the high frequency (HF) and low frequency (LF) bands. We study different combinations of the sub-band equalization and obtain best results when we perform a two-stage equalization. First, conventional HEQ is performed on the cepstral features, whic...

متن کامل

Histogram equalization of real and imaginary modulation spectra for noise-robust speech recognition

Histogram equalization (HEQ) of acoustic features has received considerable attention in the area of robust speech recognition because of its relative simplicity and good empirical performance. This paper presents a novel HEQbased feature extraction approach that performs equalization in both acoustic frequency and modulation frequency domains for obtaining better noise-robust features. In part...

متن کامل

Stereo-based histogram equalization for robust speech recognition

Optimal automatic speech recognition (ASR) takes place when the recognition system is tested under circumstances identical to those in which it was trained. However, in the actual real world, there exist many sources of mismatches between the environment of training and the environment of testing. These sources can be due to the sources of noise that exist in real environments. Speech enhanceme...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013